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Abstract 

Many real life networks present an average path length logarithmic with the 
number of nodes and a degree distribution which follows a power law. Of- 
ten these networks have also a modular and self-similar structure and, in 
some cases - usually associated with topological restrictions- their clustering 
is low and they are almost planar. In this paper we introduce a family of 
graphs which share all these properties and are defined by two parameters. 
As their construction is deterministic, we obtain exact analytic expressions 
for relevant properties of the graphs including the degree distribution, degree 
correlation, diameter, and average distance, as a function of the two defining 
parameters. Thus, the graphs are useful to model some complex networks, in 
particular several families of technological and biological networks, and in the 
design of new practical communication algorithms in relation to their dynam- 
ical processes. They can also help understanding the underlying mechanisms 
that have produced their particular structure. 
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1. Introduction 



Ten years have past since the publication of the groundbreaking papers by 
Watts and Strogatz [l[ on small-world networks and Barabasi and Albert 0] 
on scale-free networks. Their works led researchers to the design of new 
network models to describe complex systems in nature and society like the 
Internet, protein-protein interactions, transportation systems or social and 
economic networks. Their models try to match observational studies which 
have identified at least three important common characteristics for real-life 
networks: They exhibit a small average distance and diameter (compared to 
a random network with the same number of nodes and links); the number 
of links attached to the nodes obeys a power-law distribution (the networks 
are scale-free); and recently it has been discovered that, often, real networks 
are self-similar [3|1 and show a degree hierarchy related to the modularity of 
the system, see [H, 0, @] . 

Many of the proposed models are stochastic as this is the case for the 
now classical preferential attachment method 0]. Thus, the use of mean 
field techniques is required to estimate the main parameters of a network [t| . 
However, a deterministic approach has proven useful to complement and 
enhance the probabilistic and simulation techniques. Deterministic models 
have a clear advantage, as they allow an analytical exact determination of 
relevant network parameters, which then can be compared with experimental 
data coming from real and simulated networks 

Among the different methods known to generate deterministic models 
those based on recursive or iterative methods are of particular interest. In 
these methods, new nodes are added and connected to a given substructure 
of the network at each generation step. This is the case for pseudo-fractal 
networks where, at each step, new vertices are added simultaneously, 
one to each already existing link. This construction can be generalized if 
complete subgraphs of a given size (cliques) are considered instead of links 
(which are of course 2-cliques), see Similar rules give the Apollonian 
networks I 111 . [I2j. On the other hand, there also exist techniques that 



produce networks by duplication of a given substructure, see 0, H] . 

A generalization of these two methods introduces at each iteration a 
substructure which is added to the network, according to a deterministic 



rule. Substructures that have been used are triangles [15j, cycles [16( and 



paths 17 



In this paper we go one step further by considering the simultaneous 
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introduction of d substructures in parallel -in our case, paths- which are at- 
tached to the same basic unit (a link) generalizing the model given in 171 ]. 
which added a single path to each link. The resulting graphs are essen- 
tially different from those in [l7| . In particular they are scale-free (with a 
power-law exponent which depends on d) while in 17j the degree distribu- 
tion is exponential. The model is a family of planar, modular, hierarchical 
and self-similar networks, with small- world scale-free characteristics and with 
clustering coefficient zero, and all these parameter are determined by d as 
well as by the iteration step t. We note that some important real life net- 
works, for example those associated to electronic circuits, Internet and some 



biological systems [18|, [19], have these characteristics as they are modular, 
almost planar and with a reduced clustering coefficient and have small-world 
and scale-free properties. Thus, these networks are modeled by our construc- 
tion which can be considered as a new tool in the study of their associated 
complex systems. In particular, the model could be used to find also prac- 
tical algorithms in relation to dynamical processes (synchronization, cover 
time, etc.) for these technological and biological networks and can help un- 
derstanding the underlying mechanisms that have produced their particular 
structure. 

In the next section we introduce the family of graphs object of study 
and in Section [3] we calculate analytically some relevant properties for the 
graphs, namely, the degree distribution, degree correlations, the diameter 
and the average distance. The last section provides some conclusions. 



2. Generation of the graphs M d (t) 

In this section we introduce a family of modular, self-similar and planar 
graphs which have the small-world property and are scale-free. The fam- 
ily depends on an adjustable parameter d and the iteration number t. We 
provide an iterative algorithm, and also a recursive method, for its construc- 
tion. The construction methods allow a direct determination of the number 
of vertices (nodes) and edges (links) of the graph. 

Iterative construction. - We give here an iterative formal definition of the 
proposed family of graphs, M^(t), characterized by t > 0, the number of itera- 
tions and a parameter d associated with the self-repeating modular structure. 

First, we call generating edges the only edge of Md(0) and all edges of 
Md(t) whose endvertices have been introduced at different iteration steps t. 
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All other edges of Md(t) will be known as passive edges. A generating edge 
becomes passive after its use in the construction. 
The graph Md(t) is constructed as follows: 

For t — 0, Mrf(0) has two vertices and a generating edge connecting them. 

For t > 1, Md(t) is obtained from Md(t— 1) by adding, to every generating 
edge in M d (t — 1), d parallel paths of length three (each path has four vertices 
and three edges) by identifying the two final vertices of each path with the 
endvertices of the generating edge. 

The process is repeated until the desired number of vertices is reached, 
see Fig. [IJ We note that the number of vertices can be also adjusted with the 
parameter d (number of parallel paths that are attached to each generating 
edge) . 

Recursive modular construction. - The graph M^it) can also be defined as 
follows: 

(a) For t = 0, M^(0) has two vertices and a generating edge connecting them. 

(b) For t = 1, Md(l) is obtained from M^(0) by adding to its only edge d 
parallel paths of length three by identifying the two final vertices of each 
path with the endvertices of the initial edge. 

(c) For t > 2, Md(t) is made from 2d copies of Md(t — 1), by identifying, 
vertex to vertex, the initial edge of each Md(t — 1) with the generating edges 
of M d (l), see Fig. d 

Number of vertices and edges of Md(t).- We use the following notation: 
V(t), E(t) and E g (t) denote, respectively, the set of vertices, edges and gen- 
erating edges introduced at step t, while V(t) and E(t) denote the set of 
vertices and edges of the graph Mdit). 

Notice that, at each iteration, a generating edge is replaced by 2d new 
generating edges and d passive edges. Therefore: \E g (t + 1)| = 2d ■ \E g (t)\, 
and \E g (t)\ = {2d) 1 . As each generating edge introduces at the next iteration 
2d new vertices and 3d new edges we have \V(t + 1)| = 2d ■ \E^(t)\ = (2d) t+1 
and \E(t + 1)| = 3d ■ \E g (t)\ = 3d ■ (2d) 1 . As |F(0)| = 2 and \E g (0)\ = 1, the 
number of vertices and edges of M(t), t > 0, is: 

\E(t)\ = hm\ = 3d{2 Z-l~ 1 - w 



i=0 
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Figure 1: Graphs Md{t) produced at iterations t = 0, 1, 2 and 3 for d = 2. 
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Planarity. - A graph is planar if it can be drawn on the plane with no edges 
crossing. By construction of M d (t), the introduction at each iteration of d 
parallel paths connected to each generating edge, which afterwards becomes 
passive, adds 2d new vertices to the graph and they can be drawn without 
crossing edges. Planarity could also be proven from Kuratowski's theorem 
or from the known planarity test which states that a graph is planar if it has 
no cycles of length 3 and \E\ < 2\V\ — 4, \V\ > 3, see 2^ ■ 



3. Topological properties of Md(t) 

Thanks to the deterministic nature of the graphs Md(t), we can give exact 
values for the relevant topological properties of this graph family, namely, 
the degree distribution, degree correlations, the diameter and the average 
distance. 

Degree distribution.- Initially, at t = 0, the graph has two vertices of 
degree one. When a new vertex % is added to the graph at iteration ti, this 
vertex has degree 2 and it is connected to only one generating edge. We 
use the following notation: k g (i,t), k p (i,t) and k(i,t) are, respectively, the 
number of generating edges, passive edges and total edges connected to vertex 
i, at step t > ti. Therefore k(i, t) = k g (i, t) + k p (i, t) is the degree of vertex i 
at this step. 

From the construction process we can write, 

k p (i,t + 1) = k p (i,t) + k g (i,t) 

k g (i,t + l) = dk g (i,t) 1 ' 

with the initial conditions, 

kg(i, ti) 
kpii-i t^) 

and for d > 1 we have, 

k g (i,t) = <f"*% ti > and 

/ 1 + ^Et if *» = 
kp[ht) ~ \ 2 + ^ otherwise. (4) 

All the vertices that have been introduced at step ti have the same degree 
at step t: 



1, ti > and 

jo if u = o (3) 

I 1 otherwise 
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1. The two vertices introduced at step t j = have degree, 



k(i,t) = k g (i,t) + k p (i,t) = 

Jt i d*-d d m -l 

2. The |V"(ti)| = (2d) ti vertices introduced at step U > have degree, 

k{i,t) = k g (i,t) + k p (i,t) = 

, , d l ~ u - d dd l ~ u - 1 

= ^'• + 2 + ^3^ = 1 + ^3^- <«) 

Therefore the degree spectrum of the graph is discrete and to relate the 
exponent of this discrete degree distribution to the power law exponent of 
a continuous degree distribution for random scale free networks, we use the 
technique described by Newman in to find the cumulative degree distri- 
bution P cum (k). If we denote by V(t, k) the set of vertices that have degree 
k at step t, 



Pcma\k) 



J2 k ,> k \V(t,k')\ _ 2 + E*uW- 

(2d)«+i+2d-2 

(2d) u+1 + 2d-2 
[2d) t+l + 2d-2 " 

ln(fc+2^*_l) 

(2d) 1 e& +1 + 2d - 2 
(2d) t + 1 + 2d - 2 ' 



Fot t large, we obtain, 



lnffc+S-^l-l) O _ b ln(2d) 

P cum (k) » (2d) i=BJ— = (*;+— —-lpHGf 

d 

Wgd) . 1 2 — d. ln(2d) 
= ln(d) Q 1 ) ln(d) _ 

d fed 

For >> 1 this expression gives 



ln(2d) | ln(2d) 

Pcnm{k) « A;-W(l--)-T=BT (7) 
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Figure 2: Log-log representation of the cumulative degree distribution for Md(t), d 
2, 3, 4, 5. The reference line has slope -2. 



Thus, the degree distribution follows a power-law 



Pcwx(k) ~ k 7 , with 7 



ln(2d) 



and therefore the degree distribution is scale-free, see Fig. [2j 

Research on networks associated to electronic circuits show that many 
of them are almost planar, modular and have a small clustering coefficient 
and in most cases their degree distributions follow a power-law 18|, ll9] with 
exponent values in the same range than those of Md(t). 

Correlation coefficient. - We have obtained the Pearson correlation coef- 
ficient [2lj], r(d,t), for the degrees of the endvertices of the edges of M d {t). 
In Appendix A we present the details of the calculation that leads to the its 
exact analytical expression as shown in Eq. [TTJ We particularize this general 
analytical result for different instances of the graphs, obtaining numerical 
values of the correlation as shown in Table [1] . 

From the analytical results and the numerical values of the correlation 
coefficient we see that this family of graphs has the degrees of the endvertices 
negatively correlated (large degree vertices tend to be connected with low 
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t = 1 


t = 2 


t = 3 


t = 10 


d = 2 


-0.1667 


-0.0886 


-0.0460 


-0.0003 


d=10 


-0.4091 


-0.2338 


-0.1174 


-0.0009 


d= 100 


-0.4901 


-0.2057 


-0.0934 


-0.0007 



For d » 1, we obtain r(d,t) ~ — — — -, which for t large gives 

)t 1 



Table 1: Correlation coefficient at steps t = 1,2, 3, 10 for several values of d. 

degree vertices) and the graphs are disassortative, as it occurs with many 
technological and biological networks (l9| . 

1 

1 — 3-2 

r(d,t)~0. 

Diameter.- At each iteration step we introduce, for every generating edge, 
2d new vertices. These vertices are among them at distance at most 3. As 
each vertex joins the graph of the former step through one new edge, the 
diameter will increase by exactly 2 units. Therefore D(t) = D(t — 1) + 2, 
t > 2. AsD(l) = 3, we have that the diameter of M d (t) is D(t) = 3+2-(t— 1), 
t > 1. Therefore, from Eq. [TJ and as for t large, t ~ In |V(£)| we have in this 
limit that D(t) ~ In \V(t)\. 

Average distance. - The average distance of Md(t) is defined as: 

D{t) = ]T dkj, (8) 

|V(t)|(|V(t)|-l)/2. 

where d^j is the distance between vertices i and j. 

In Appendix B we use the modular recursive construction of Md(t) to 
calculate the exact value of D(t) which results: 

D(t) = (-1 +4d-5d 2 + 2d 3 + 2 1+t d 1+t -7 -2 2t d 2+2t + 
+ 3 • 2 1+2t d 3+2t - 2 l+t d l+t t + 3 ■ 2 1+t d 2+t t - 
- 2 2+t d 3+t t - 2 1+2t d 2+2t t + 2 2+2t d 3+2t t) 
I ((-1 + 2d){-\ + d + 2 t d 1+t )(-l + 2 1+t d 1+t )). 

(9) 

Notice that for a large iteration step, t — >• oo, ~ t ~ ln|V(t)|, 

which shows a logarithmic scaling of the average distance with the number 
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of vertices of the graph. As we have a similar behavior for the diameter, the 
graph is small-world. 



4. Md(t) as a model for some technical and biological networks 

The graphs introduced here have parameters which are similar to those of 
some real life networks. A good example is the largest benchmark considered 
in [HI -a network with 24097 nodes, 53248 edges, average degree 4.34 and 
average distance 11.05- has a degree distribution which follows a power-law 
with exponent 3.0, and it has a small clustering coefficient C = 0.01 and 
other network properties are also in the same range than those of the graph 
Mq(4), see 19j. Table [2] compares some network parameters from instances 
of our model with data coming from real networks published elsewhere. Al- 
though there are many similarities between the two sets, the aim of this 
model is not to match perfectly all the network parameters for some real life 
complex systems, but to provide an analytical framework where to perform 
precise tests of new algorithms (routing, synchronization, etc.) and check 
properties that otherwise would require less general and precise techniques 
like simulation of stochastic methods. 



Network 


Vertices 


Edges 


7 


Avg. 


Clust. 


Avg. 


r 


Ref(s). 










dist. 




degree 






M a (5) 


1366 


2047 


3 


6.850 





2.997 


-0.001 




Java Dev. Fram. 


1376 


2174 


2.5 


6.39 


0.06 


3.160 


-0.002 


[22,19] 


M 6 (2) 


158 


235 


2.39 


3.290 





2.975 


-0.233 




Silwood Pk food web 


154 


366 


1 


3.4 


0.15 


4.75 


-0.31 


[23] 


M 6 (3) 


1886 


2827 


2.39 


4.474 





2.998 


-0.130 




•protein inter. S. C. 


2115 


2240 


2.4 


6.80 


0.071 


2.089 


-0.156 


[24, 19] 


M 6 (4) 


22622 


33931 


2.39 


5.557 





3.000 


-0.007 




electronic circuits 


24097 


53248 


2.39 


11.05 


0.01 


4.34 


-0.130 


[18,19] 


M 8 (3) 


4370 


6553 


2.33 


4.482 





2.999 


-0.123 




power grid 


4941 


6594 




19.99 


0.1 


2.669 


-0.003 


[1, 19] 



Table 2: Some instances of Mj(t) and possible real network counterparts. 



5. Conclusion 

The graphs M d (t) introduced and studied here are planar, modular, have 
a disassortative degree hierarchy and are small-world and scale-free. Another 



10 



relevant characteristic of the graphs is their zero clustering. A combination 
of a low clustering coefficient, modularity, and small- world scale-free prop- 
erties can be found in some real networks, in particular in technological and 



biological networks |19l . Il8j , and most of them are also disassortative. 

Finally, we should emphasize that the planar property and the deter- 
ministic character of the family, in contrast with more usual probabilistic 
approaches, should facilitate the exact determination of other network pa- 
rameters and the development of new network algorithms that then might 
be extended to real-life complex systems. 
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APPENDICES 



A. Correlation coefficient calculation. 



The Pearson correlation coefficient, r(d,t), for the degrees of the endver- 
tices of the edges of M d (t) is: 

where ji, fcj are the degrees of the endvertices of the zth edge, with i = 



l,---,\E(t)\, see [21 



To calculate the correlation coefficient we need to know the degree distri- 
bution of the endvertices of the edges in E(U) at a given step U. We denote 
by 0'j k) an edge connecting vertices of degrees j and k. 

The detail of this distribution is given as follows: 

The edges introduced at step ti are: 

1. Edges (2,2), connecting two vertices introduced at step £, > 0. There 
are (2d) ti /2 edges (a half of the vertices introduced at step ti). Notice 
that there is one edge (1, 1) introduced at i; = 0. 
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2. Edges (2, k(i',ti)) connecting vertices of degree two, introduced at step 
tj, with all the vertices i' introduced at step ty with < ty < t« — 1. 
For each vertex i' there are k g (i', ti) edges: 

From the two vertices introduced at ty = 0, see (jSJ), there are 2d* 1 ' 
edges <2, 

From the (2d)**' vertices introduced at £ 4 / > 0, see ([2]), there are 
(2d)Vd**-V edges (2, 1 + ^ff^ ). 

Table [3] here displays a summary of the results. 



Step ti 


Edges at step £j 


Number 


Edges at step t > ti 


u = o 


(1,1) 


1 


id t+1 -l d t+1 -l\ 
\ d-1 ' d-1 / 


l<U<t 


(2,2) 

/O d f i + 1 -l\ 
\ Z ' d-1 / 


(2d) t i 
2 

2d u 


/ *j | dd*-*»-l -1 | dd*-**-l\ 
"r d-1 ' 1 1 d-1 / 

/i , d^-'i-l d t+1 -l\ 
\ d-1 ' d-1 / 


2<ti<t 

1 < u> < U - 1 


(2,1 + ^fF) 


(2d)**' • d**~**' 


/-i , dd'-'i-l I | dd*~V-l\ 

"T d-1 ' 1 1 d-1 / 



Table 3: Number of edges in Md(t) according to the degrees of their endvertices. 



Using these results, we can find the following sums: 



J^Jik = (4 + 16d - 51 d 2 + 41d 3 -8d 4 - 

i 

- 3d 5 + d 6 + d t+1 (A0 + 8t-80-2 t ) + 

+ d* +2 (-184 - 40i + 282-2*) + 

+ d* +3 (306 + 74t- 373-2*) + 

+ d i+4 (-236 - 64t + 227-2*) + 

+ ff* +5 (86 + 26t-63-2*) + 
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+ d t+6 (-12-4t + 7-2 t ) + 

+ d 2t+2 (10 + 4t) + d 2t+3 (-43 - 18*) + 

+ d 2t+4 (62 + 28t)+d 2t+5 (-35-18t) + 

+ d 2t+6 (6 + 4t))/((d- l) 3 
(2d 2 -5d + 2)(d-2)), 

+ ^ = (2d-l ) (/- 2 2 ) (d-l ) 2 ( - 2 - 3d + 

+ 10d 2 -6(i 3 + d 4 + d* +1 (-4 + 16-2 i ) + 

+ cf f2 (14-37-2 t ) + d t+3 (14 + 26-2') + 

+ d* +4 (4-5-2')-d 2 *+ 2 + 3d 2t+3 - 

- 2d 2t+4 ), 

+ kf ) = -2(-8 - 32d + 186d 2 - 282d 3 + 

i 

+ 145d 4 + 49d 5 -96(i 6 + 48(i 7 -lW 8 + 

+ d 9 + d* +1 (-72 + 160 • 2* ) + d t+2 (384 - 

- 728 • 2*) +d* +3 (-750 + 1252 • 2*) + 

+ d t+4 (606 - 882 • 2*) + d* +5 (-33 - 39 • 2*) + 

+ d* +6 (-285 + 456 • 2*) + d* +7 (201 - 286 • 2*) 

+ d* +8 (-57 + 74-2 i ) + d t+9 (6-7-2*) + 

+ 4d 3 '+ 2 - 16d 3t+4 + 17d 3t+5 + 5d 3t+6 - 

- 19d 3t+7 + lld 3t+8 - 2d 3t+9 )/((d 2 -2d+l) 
(2d - l)(d - 2)(d 3 - 2d 2 -2d + 4)(d - l) 2 ). 



Replacing these sums into equation (TT0[) we obtain, for any d, the exact 
analytical expression for the Pearson correlation coefficient of Md(t) which is 
displayed as Eq. [TTJ For d = 2 this equation becomes Eq. 



B. Analytical determination of the average distance. 

The average distance of M d (t) is defined as: 

D(t) = y d l7 , (13) 

|V(t)|(|V(t)|-l)/2 M .^ (t) J 
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r{d,t) 



(d(3 ■ 2 t d t -!)-!)■ (B(d, t) + C(d, t)) - (D(d, t) + E(d, t)) 2 
(d(3 • 2*rf* - 1) - 1) ■ F{d, t) - (D{d, t) + E{d, t)) 2 



,d > 2 
(11) 



Where: 

B(d,t) = {d 3 + 2 ■ d 2 - 6d - 1) + d t+1 2(2d - 1)((5 - 3d) + 

+ i(l - d)) + d t+2 (-28 • 2* + (-5 + 2t)d l ) + d t+3 (7 ■ 2* + (6 + 4t)d 4 ), 

C(d,t) = (d* +3 (31-2 t + (-ll + 6t)d*) + d* +2 (-90-2 t + 

+ (10 - 12i)c2*) + rf* +1 80 • 2*)/((d - 2) 2 ), 

D(d,t) = d 2 - 3d - 1 + d*(6 • 2* - 6d*) + <i t+1 (ll • 2* - 3d* - 2) + d t+2 (-5 ■ 2* - 2d* + 4), 

E(d,t) = -12d*(d*-2*)/(d-2) ) 

F(d, i) = (d 5+t (-6 + 7 • 2* + 2d 2t ) + d 4+ * (21 - 32 • 2* + d 2t ) + d 3+ * (3 + 3 • 2* - d 2t ) + 

+ d 2+t (-42 + 62 • 2*) + d 1+t (18 - 40 • 2 4 ) - d 5 + 5d 4 - 5d 3 - lid 2 + Ud + 2)/{d 2 - 2). 



where dij is the distance between vertices i and j. In what follows, S(t) will 
denote the sum Ylijevct) dij- 

The modular recursive construction of Md(t) allows us to calculate the 
exact value of D(t). At step t, Md(t+1) is obtained from the juxtaposition of 
2d copies of Md(i), which we label M^', rj — 1, 2, • • • , 2d, see Figures [1] and [3j 

Whenever possible, we drop the subscript d and represent M^} as to 
keep the notation uncluttered. The copies are connected one to another at 
2d + 2 vertices which we call connecting vertices. Two of them are the initial 
two vertices of the graph, which will be denoted in this section as A and B. 

In Fig. Owe display A, B and four more of these vertices, denoted as u, 
v, w and x. Note that in this figure, and for the sake of clarity, each copy of 
Md(t) has been represented as a rectangle, and only its connecting vertices 
have been drawn. 



For d = 2: 



r(2,f) 



AH 2 - 2 l +H + 3 ■ 2 2t+1 t - 2 3t +H + 13 ■ 4* - 3 ■ 2 t+1 + 4 2t+1 - 3 ■ 2 3t + 2 + 1 
2 4t+1 t 2 + 2 2t+1 t - 2 3t +H + 2 4t +H - 2* + 5 • 4* - 2 3t + 4 + 3 • 2 4t + 3 - 3 • 2 5t + 2 



(12) 
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Figure 3: Md{t + 1) is obtained from the juxtaposition of 2d copies of Md{t). 

Thus, the sum of distances S t +i satisfies the following recursion: 

S t+1 = 2dS t + A t . (14) 

where A t is the sum over all shortest path length whose endpoints are not in 
the same M t branch. 

To compute A t , we classify the vertices of M d (t + 1) into two categories: 
the two vertices with the largest degree (i.e., A and B in Fig. [3]) are called 
hubs, while all other vertex are called non-hub vertex. Thus At can be 
obtained by adding the following path lengths that are not included in the 
distance between vertex pairs of M t : length of the shortest paths between 
non-hub vertices, length of the shortest paths between a hub and non-hub 
vertices, and length of the shortest paths between hubs (for example, d uv , 
d U Bi and d ux ). 

Let us denote A" ,/3 as the sum of all shortest paths between non-hub 
vertices, whose endpoints are in and M t , respectively. Thus, A"'^ 

rules out the paths with endpoints at the connecting vertices belonging to 
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Mi or M t . For example, each path contributing to A]' 2 does not end at 
vertex u, v, A or B, and each path contributing to Aj' 4 does not end at vertex 
u, A, B or x. According to its value, A^ can be split into three classes, 
where the three representatives are A^' 2 , A^' 3 , and A*' 4 , and the cardinality 
of the three classes are d, d(d — 1), and d(d — 1), respectively. Analogously, 
the length of the shortest paths between a hub and all non-hub vertices can 
be classified into two classes, while the shortest paths between hubs can be 
partitioned into three classes with path lengths equal to 1, 2, or 3. 

Let be the set of non-hub vertices in Mj; a \ then the total sum A t is 
given by 

At = dAj' 2 + d{d - 1) (Aj' 3 + A] A ) + 2d{d + 1) 
d Aj + 2d(d-l) £ duj + 

+ (d + l)d uv + d(d + 1) d uw + d(d — 1) d ux , (15) 

where d uv = 1, d uw = 2, and d ux = 3 are easily seen. 

Having A t in terms of the quantities of A]' 2 , Aj' 3 , A^' 4 , Zljenf ^Aj Y,jeo? d U j, 
and d U j, the next step is to explicitly determine these quantities. To 

this end, we classify non-hub vertices in Md(t + 1) into two different parts 
according to their shortest path lengths to either of the two hubs (i.e. A and 
B). Notice that the vertices A and B themselves are not partitioned into 
either of the two parts represented as Pi and P2, respectively. The classifica- 
tion of vertices is shown in Fig. [3]). For any non-hub vertex <p, we denote the 
shortest path length from ip to A, B as a, and b, respectively. By construc- 
tion, a and b can differ at most by 1 since vertices A and B are adjacent. 
Then the classification function class((p) of vertex ip is defined to be 

/ ( \ i P ± for a < 6, , 
dass((p) = < „ . , (16) 

vyv [ P 2 for a > b. K ' 

It should be mentioned that the definition of the vertex classification is 
recursive. For instance, class Pi and P2 in M t belong to class Pi in Md(t+1), 
class Pi and P2 in M t belong to class P2 in M^it + 1), and so on. Since 
the two hubs A and B are symmetrical, in the graph we have the following 
equivalent relations from the viewpoint of class cardinality: classes Pi and P2 
are equivalent one to another. We denote the number of vertices in network 
Md(t) that belong to class Pi as N t) p x , and the number of vertices in class P2 
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as N t) p 2 . By symmetry, we have N t ^p 1 = N t ^p 2} which will be abbreviated as 
N t hereafter. It is easy to see that 



t — 



\V(t)\ 



2 



- 1 



d{2d) t - d 
2d - 1 



(17) 



For a vertex (p in Md(t + 1), we are also interested in the smallest value 
of the shortest path length from ip to either of the two hubs A and B. We 
denote the shortest distance as this value by f v , and it can be defined as 



Let 5( i p 1 (St,p 2 ) denote the sum of f v for all vertices belonging to class P\ 
(P 2 ) in Md(t). Again by symmetry, we have 5 fj p 1 = S t} p 2 that will be written as 
5t for short. Taking into account the recursive method of constructing Ma(t), 
we notice that the vertex classification follows also a recursion. Therefore we 
can write the following recursive formula for St+i'. 



Substituting equation ( fl7l) into equation ( fl9l) . and considering the initial 
condition 5q = 0, equation ( fl9i) is solved inductively 



min(a, b). 



(18) 



6 t+ i = 2d5 t + dN t + d. 



(19) 



2d - 2d 2 - 



(2d) 1+t + d(2d) 1+t 



dt(2dY + dt{2d) 



i+t 



(20) 



2(2d- l) 2 



We now return to compute equation ( 1151) . For convenience, we use T^' 1 to 
denote the set of non-hub vertices belonging to class Pi in M t . Then A]' 2 
can be written as 



A, 1 





+ X/ (dru + d u B + rffis) + ^ (rf rn + + d vs ) 




(21) 
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Analogously, we find 

A]> 3 = 8N t 5 t + 4(N t ) 2 (22) 

and 

A 1 / = 8N t 5 t + 8(N t ) 2 . (23) 

Next we will determine other quantities in equation (fl5j) . with J2jen^ d-Aj 
given by 

d Aj = ( d AB + d Bj ) + ( d Av + d vj ) 

= 2 5 t + 3N t . (24) 



Analogously, we can obtain 



J2 d uj = 25 t + 5N t . (25) 



Substituting equations (f2T]l. (l2"2j), (123"]), (121]) and (125]) into equation flU 
we have the final expression for cross distances A t , 

A t = l + 5d 2 + Ad(Ad-l)N t + Qd{2d-l)(N t ) 2 + 
+ 8d5 t [d+(2d-l)N t } = 

! [l-4d + 5d 2 -2d 3 + (l-d)(2d) 2+t + 



(l-2k) 2 

+ (5 + 2t)4 1+ *d 4+2 ' - (7 + 2t)d 2 (2d) 1+2t ). 

(26) 

Inserting equation (126]) into equation (1T41) and using the initial condition 
50 = 1, equation (1T4"]) is solved inductively, 

S t = j 2rf)3 (-1 + Ad - 5d 2 + 2d 3 + 2 1+t d l+t - 

- 7 ■ 2 2t d 2+2t + 3 • 2 1+2t d 3+2t - 2 1+t d l+t t + 

+ 3 • 2 1+t d 2+t t - 2 2+t d 3+t t - 2 1+2t d 2+2t t + 

+ 2 2+2t d 3+2t t). (27) 

Substituting equation (127]) into equation (ITBl yields the exact analytic ex- 
pression for the average distance of Md(t) as 

D(t) = (-l + 4d-5d 2 + 2d 3 + 2 1+t d l+t -7-2 2t d 2+2t + 
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+ 3 • 2 1+2t d 3+2t - 2 1+t d 1+t t + 3 • 2 1+t d 2+ H - 

- 2 2+t d 3+t t - 2 l+2t d 2+2t t + 2 2+2t d 3+2t t) 

I ((-1 +2d)(-l + d + 2 t d 1+t )(-l + 2 1+t d 1+t )). 

(28) 
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